Accentuation of Adpositions and Particles in a Text-to-Speech System for Dutch

نویسنده

  • Nicole Grégoire
چکیده

In this paper I propose an accent placement algorithm that locates accents on adpositions and particles for the use in a Dutch text-to-speech (TTS) system. The algorithm is intended to be a refinement of the rule that accents only content words, which is used in most TTS systems. Before the algorithm is set up, I discuss when adpositions and particles are accented in Dutch. For this empirical research, I made use of the Spoken Dutch Corpus (CGN) as empirical material. The combination of part-of-speech, syntactic as well as prosodic information for approximately 125,000 words in the CGN made it possible to determine whether the accentuation of adpositions and particles depends on their syntactic use within a sentence, or on the syntactic use of other constituents in the same sentence. The proposed accentuation algorithm takes a dependency tree with part-of-speech information as input.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic accentuation and prosodic phrasing for dutch text-to-speech conversion

Correct accentuation and phrasing improves the quality of synthetic speech. This paper discusses an algorithm which assigns both sentence accents and phrase boundaries on the basis of the prosodic sentence structure. Although this latter structure is theoretically derived from the syntactic structure, the present algorithm deterrnines the prosodic structure by means of (linguistically and stati...

متن کامل

Cipher text only attack on speech time scrambling systems using correction of audio spectrogram

Recently permutation multimedia ciphers were broken in a chosen-plaintext scenario. That attack models a very resourceful adversary which may not always be the case. To show insecurity of these ciphers, we present a cipher-text only attack on speech permutation ciphers. We show inherent redundancies of speech can pave the path for a successful cipher-text only attack. To that end, regularities ...

متن کامل

Evaluation of a sentence accentuation algorithm for a dutch text-to-speech system

In this contribution an algorithm for the automatic assignment of sentence accents in written Dutch, developed by Kager and Quene, is evaluated experimentally. Results show that the output of the algorithm is judged significantly more adequate than accents randomly distributed over content words but significantly less adequate than the accents produced by a trained news broadcaster and those re...

متن کامل

Information extraction and text generation of news reports for a Swedish-English bilingual spoken dialogue system

This paper describes an experimental dialog system designed to retrieve information and generate summaries of internet news reports related to user queries in Swedish and English. The extraction component is based on parsing and on matching the parsing output against stereotypic event templates. Bilingual text generation is accomplished by filling the templates after which grammar components ge...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004